Falsification Testing of Sepsis Prediction Models: Evaluating Independent Biological Signal After Controlling for Care-Process Intensity

This pre-registered falsification study across four clinical datasets demonstrates that while sepsis prediction models at elite academic centers primarily detect genuine biological signals rather than care-process intensity, they reveal a systematic and consequential divergence between clinical sepsis definitions and administrative coding that undermines the validity of regulatory metrics and AI benchmarks relying on the latter.

Dickens, A. R.2026-03-18📄 health informatics

Context-Aware Emergency Department Triage Using Pairwise Comparisons and Bradley-Terry Aggregation

This study demonstrates that a zero-shot, pairwise comparison approach using a large language model and Bradley-Terry aggregation for emergency department triage significantly outperforms the standard Emergency Severity Index and maintains cross-site stability without requiring site-specific training, matching the performance of supervised models on external data.

Jarrett, P., Reeder, J., McDonald, S. + 2 more2026-03-17📄 health informatics

Artificial Intelligence for Automated, Highly Accurate, and Scalable Multimodal EHR Data Abstraction

The authors developed an AI-driven pipeline that leverages multimodal EHR data and an ensemble meta-learner with a dual-threshold confidence framework to automate the abstraction of Society of Thoracic Surgeons Adult Cardiac Surgery Database variables, achieving over 99% accuracy while significantly reducing the manual data collection burden across two healthcare systems.

Margaritis, G., Petridis, P., Bertsimas, D. + 5 more2026-03-17📄 health informatics

Persistent Proxy Discrimination in HIV Testing Prediction Models: A National Fairness Audit of 386,775 US Adults

This national fairness audit of 386,775 US adults demonstrates that enforcing demographic parity in HIV testing prediction models is inappropriate for differential-burden clinical contexts, as it significantly reduces screening access for high-risk populations and underscores the need for fairness metrics like equalized odds and calibration that align with clinical needs.

Farquhar, H.2026-03-16📄 health informatics

WITHDRAWN: Causal Effects of Natural Language Processing-Enhanced Clinical Decision Support on Early Cognitive Impairment Detection: A Propensity Score Analysis Using Inverse Probability of Treatment Weighting

This paper is a withdrawn study from medRxiv that claimed to analyze the causal effects of natural language processing-enhanced clinical decision support on early cognitive impairment detection, but was retracted because it was submitted with false information.

Dimitriou, A., Foster, M.2026-03-16📄 health informatics

WITHDRAWN: Blockchain-Enabled Health Information Exchange Efficiency Across South Korean Hospital Networks: A Stochastic Frontier Analysis with Bayesian Model Averaging

This withdrawn study utilized Stochastic Frontier Analysis with Bayesian Model Averaging on a panel of 247 South Korean hospital networks to demonstrate that blockchain-enabled health information exchange systems significantly improve technical efficiency compared to conventional platforms, even after controlling for endogeneity and model uncertainty.

Park, J.-H., Kim, S.-Y.2026-03-16📄 health informatics

Reward-Guided Generation Improves the Scientific Utility of Synthetic Biomedical Data

The paper introduces RLSYN+REG, a reinforcement learning-driven generative model that significantly enhances the scientific utility of synthetic biomedical data by ensuring regression models trained on it accurately reproduce the coefficients and predictions of models trained on real data, while maintaining high fidelity and privacy.

Jackson, N. J., Espinosa-Dice, N., Yan, C. + 1 more2026-03-16📄 health informatics

Predicting cognitive impairment using novel functional features of spatial proximity and circularity in the digital clock drawing test

This study demonstrates that novel mathematical functional features of spatial proximity and circularity derived from digital clock drawing tests can predict cognitive impairment with accuracy comparable to traditional summary statistics, offering a promising approach to enhance early detection strategies.

Pinheiro, A., Karjadi, C., Tripodis, Y. + 6 more2026-03-16📄 health informatics

Ollivier Ricci Curvature as a Geometric Biomarker for Biomedical Networks: From Ontology to Comorbidity Aging Trajectories

This paper establishes Ollivier–Ricci curvature as a quantitative geometric biomarker for biomedical networks by demonstrating its ability to distinguish structural phases in medical ontologies, map monotonic aging trajectories in disease comorbidity, and complement sedenion-based features in differentiating brain disorders, all while validating core mathematical claims through machine verification.

Agourakis, D. C., Gerenutti, M.2026-03-16📄 health informatics

Comparative Evaluation of Logistic Regression and Gradient Boosting Models for Influenza Outbreak Early-Warning Using U.S. CDC ILINet Surveillance Data (2010-2025)

This study demonstrates that both logistic regression and gradient boosting models achieve near-perfect accuracy in detecting national influenza outbreaks using U.S. CDC ILINet surveillance data from 2010 to 2025, validating the operational utility of framing early-warning as a threshold-based binary classification problem.

Onwuameze, C. N., Madu, V.2026-03-13📄 health informatics

Self-Reported Side Effects of Semaglutide and Tirzepatide in Online Communities

This study analyzed over 410,000 Reddit posts to reveal that while gastrointestinal symptoms are the most common self-reported side effects of semaglutide and tirzepatide, online communities also highlight emerging concerns like reproductive and temperature-related issues that are often underrepresented in clinical trials and official labeling.

Sehgal, N. K. R., Tronieri, J. S., Ungar, L. + 1 more2026-03-13📄 health informatics

The Orphanet Nomenclature and Classification of rare diseases: a standard terminology for improved patient recognition and data interoperability

This paper presents the Orphanet Nomenclature and Classification system as a comprehensive, multilingual, and interoperable standard for rare diseases, detailing its July 2025 update which includes nearly 10,000 clinical entities with extensive mappings to major medical terminologies to facilitate global data sharing, accurate patient identification, and improved healthcare outcomes.

Lucano, C., Lagorce, D., Olry, A. + 14 more2026-03-12📄 health informatics